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RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 




Source: \tnQ0 



Date Processed by STIC: 'AlWl 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS, 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin21heIp(5) y uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/ofrices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there 

a possibility that the disk/CD-Rom may nave been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mail One, 1911 South Clark Street, Arlington, VA 22202 



U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
20 1 1 South Clark Place, Arlington, VA 22202 
4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequerfce, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 



Or 



Revised 01/29/2002 



Raw Sequence Listing Error Summary 



ERROR DETECTED 



SUGGESTED CORRECTION' 



SERIAL NUMBER: 



03 Is3 7 t 7/0 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA** HEADERS, WHICH WERE INSERTED BV PTO SOFTWARE 



1 Wrapped Nucleics 

Wrapped Aminos 



10 



The numberAext at the end of each line "wrapped" down to Ihe next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping." 



_Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



_Misaligned Amino The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 

_Non- ASCII The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission Is saved in ASCII text 

_Variable Length Sequences) contain n's or Xaa*s representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

_PatentIn 2.0 A "bug** in Pat en tin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

"bug" sequencers) " . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections Tor 
Artificial or Unknown sequences. 

_Skipped Sequences Sequences) 



CD 

m 

i 



(OLD RULES) 



missing. If intentional, please insert the following lines for each skipped sequence: 
(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTIONS EQ ID NO:X: (insert SEQ ID NO where "X"> shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:** response to include the skipped sequences. 



8 Skipped Sequences 

(NEW RULES) 



Sequences) _ 



missing. If Intentional, please insert the following lines for each skipped sequence. 



\/ Use of n*s < 



t or Xaa's 
(NEW RULES) 



Invalid <213> 



Response 



Useof<220> 



<2I0> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n*s or Xaa's are present 
In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 
>* /* *• 

Per 1.823 of Sequence iftles, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequencers) _ 



w missing the <220> "Feature** and associated numeric identifiers and responses. 



12 Patentln 2.0 * 

"bug** 



Use of <220> to <223> is MANDATORY if <213> "Organism** response is "Artificial Sequence** or 
"Unknown.** Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register,** 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sea 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, . 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager** or any other manual means to copy file to floppy disk. 



8 
3 



1 3 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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Dots Not Comply 
9§ffeetec5 Diskette Needed 



1600 



RAW SEQUENCE LISTING 

PATENT. APPLICATION: US/09/537,710 



DATE: 04/08/2002 
TIME: 10:57:03 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\04082002\l537710.raw 



3 <110> 

4 

5 

6 

7 

9 <120> 
production 
10 
11 

13 <130> 
15 <140> 
17 <141> 

19 <150> 

20 <151> 
22 <160> 
24 <170> 



APPLICANT: Dahlqvist, Andres 
Stahl, Ulf 
Lenman, Marit 
Banas, Antoni 
Ronne, Hans 

TITLE OF INVENTION: A new class of enzymes in the biosynthetic pathway for the 
of 

triacylglycerol and recombinant DNA molecules encoding these enzymes 
FILE REFERENCE: BASFnae337799PCTl- 15 
CURRENT APPLICATION NUMBER: US 09/537,710 
CURRENT FILING DATE: 2000-03-30 
PRIOR APPLICATION NUMBER: EP 99106656.4 
PRIOR FILING DATE: 1999-04-01 
NUMBER OF SEQ ID NOS : 31 
SOFTWARE: WordPerfect version 6.1 



ERRORED SEQUENCES 



457 <210> SEQ ID NO : 5 

458 <211> LENGTH: 2427 

459 <212> TYPE: DNA 

460 <213> ORGANISM: Arabi 

462 <400> SEQUENCE: 5 

463 agaaacagct ctttgtctct 

464 ctggacgaga tttgacaaag 

465 tgccccttat tcatcggaaa 

466 tggtgcacga tgaggattcg 

467 aatcgaacgg aggagggaag 
4 68 tgtgtgtaac ctggtggttt 
4 69 agtatgtaac ggagcgaatc 

470 aaagaaggtc ttaaggcgaa 

471 ctcgagcttt gggaaggcaa 

472 ggaacttttg gtgaagtcta 

473 aatgaaactg ggttggatcc 

474 gctgactact ttgctcctgg 

475 attggatatg aagagaaaaa 
47 6 aacacagagg tacgtgatca 

477 tctaccaacg gtggaaaaaa 

478 ctacatttta tgaagtgggt 

479 tggtgtgcaa agtatattaa 

480 aaagctgttg cagggctttt 

481 gccccaggat tcttagacac 



sequences for similar errors. 



ctcgactgat 
tccgtatagc 
aagccgacgg 
caaaagaaac 
tggtcgtgca 
cttctcttcc 
acgggtcctt 
acatcctgtt 
acaatgcgct 
caaaaggcct 
agctggtatt 
ctactttgtc 
tatgtacatg 
gactcttagc 
agcagttata 
tgaggcacca 
ggcggtgatg 
ctctgctgaa 
cgatatattt 




The type of errors shown exist throughout ^""^ j A v 
the Sequence Listing. Please check subsequent XrJ^A^ 

dopsis thaliana^-""" ^\iJH^^^ 



ctaacaatcc 
ttaacctggt 
agaaaccatc 
cacacgaatc 
tcgattcttg 
tttacaacgc 
tgcctgaccc 
gtcttcattc 
gatggtttat 
ctatgttggg 
agagttcgag 
tgggcagtgc 
gctgcatatg 
cgtatgaaaa 
gttccgcatt 
gctcctctgg 
aacattggtg 
gcaaaggatg 
agacttcaga 



ctaatctgtg 
ttaatttcaa 
gacgccgcca 
ttccaaatcc 
ttgttggttc 
aatgcctgcg 
gcccggtgtt 
ctgggattgt 
ttagaaaacg 
tggaacacat 
ctgtatcagg 
tgattgctaa 
actggcggct 
gtaatataga 
ccatgggggt 
gtggcggggg 
gaccatttct 
ttgcagttgc 
ccttgcagca 



ttctaaattc 
gtgacagata 
tctgaagagg 
caccataaga 
attgggtgtg 
agcttccctc 
aagctcaaaa 
caccggtggg 
tttgtggggt 
gtcacttgac 
actcgtggct 
ccttgcacat 
ttcgtttcag 
gttgatggtt 
cttgtatttt 
tgggccagat 
tggtgttcca 
cagagcgatt 
tgtaatgaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 



file://C:\CRF3\Outhold\VsrI537710.htm 
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482 
483 
484 
485 
486 
487 
488 
489 
490 
491 
492 
493 
494 
495 
496 
497 
498 
499 
500 
501 

E--> 502 
503 
719 
720 
721 
722 
724 

E--> 725 

E--> 726 
727 

E--> 728 
729 
730 
731 
732 
733 
734 
735 
866 
867 
868 
869 

E--> 871 
872 
873 
875 
876 
878 
879 



RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/537,710 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\04082002\I537710.raw 



atgacacgca catgggactc aacaatgtct atgttaccga agggaggtga 
ggcgggcttg attggtcacc ggagaaaggc cacacctgtt gtgggaaaaa 
aacgaaactt gtggtgaagc aggtgaaaac ggagtttcca agaaaagtcc 
ggaaggatga tatcttttgg gaaagaagta gcagaggctg cgccatctga 
attgattttc gaggtgctgt caaaggtcag agtatcccaa atcacacctg 
tggacagagt accatgacat gggaattgct gggatcaaag ctatcgctga 
tacactgctg gtgaagctat agatctacta cattatgttg ctcctaagat 
ggtgccgctc atttctctta tggaattgct gatgatttgg atgacaccaa 
cccaaatact ggtcaaatcc gttagagaca aaattaccga atgctcctga 
tactcattat acggagtggg gataccaacg gaacgagcat acgtatacaa 
tctcccgaca gttgcatccc ctttcagata ttcacttctg ctcacgagga 
agctgtctga aagcaggagt ttacaatgtg gatggggatg aaacagtacc 
gccgggtaca tgtgtgcaaa agcgtggcgt ggcaagacaa gattcaaccc 
aagacttata taagagaata caatcactct ccgccggcta acctgttgga 
acgcagagtg gtgcccatgt tgatatcatg ggaaactttg ctttgatcga 
agggttgccg ccggaggtaa cgggtctgat ataggacatg accaggtcca 
tttgaatggt cggagcgtat tgacctgaag ctgtgaatat catgatctct 
ctgtcagctt atgtgaatcc aatactttga aagagagatc atcatcaatt 
gtcatcatca tgatgctcaa ctcacaaaga agcctgagaa tgatactttg 
ctcaatacct ctttaatatt cttattgaat gtaaattata caatcctatc 
acgataacac aaaacttgct gcngccat.gt ttgtttgtct tgtcaaaagc 
gggttaaaaa aaaaaaaaaa aaVaaaa 
<210> SEQ ID NO: 9 
<211> LENGTH: 616 

<212> TYPE: DNA COiK^ 
<213> ORGANISM: Neurospora crassa 
<400> SEQUENCE: -9 f 

ggtggcgaag acgMg&cgg aagttggagg ctaacgagaa tga^n^tcgg 
accctctaga gacaCgacta c<m^.tgcacc cagcctcaag gtpta<£igt.t 

ggaagccgac ggagcgagcc tacatctatc tggcgcccga tcticgggacg 
tttagatgac gatcgatacg actttgactmj aggggcacat tgaccacggt 

gcgaaggcga tggcacagtg aaccttatga gtttggggta cctgtgcaat 
aaatgaagag atacaatcct gcgggctcaa aaataaccgt ggtcgagatg 
cagaacggtt caatccgaga ggagggccga atacggcgga tcacgtggat 
ggcagaatct aaacgagtac attcttaaag tggcggcagg tcgaggcgat 
attttattac tagtaatatt cttaaatatg tagaaaaggt tgaaatttat 
taaatacggc acataggtta ctcaatagta tgactaatta aaaaaaaatt 
aaaaaaaaaa aaaaa, 



DATE: 04/08/2002 
TIME: 10:57:03 



cacgatatgg 
gcaaaagaac 
tgttaactat 
gattaataat 
tcgtgacgtg 
gtataaggtc 
gatggcgcgt 
gtatcaagat 
gatggaaatc 
gcttaaccag 
ggacgaagat 
cgtcctaagt 
ttccggaatc 
agggcgcggg 
agatatcatg 
ctctggcata 
ttaagctgtc 
catcatcatc 
gtgcgaaatt 
taatgtttga 
atcaatttgt 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2427 



agatggatct 
tnjtatgggta 

acaacgcatc 
gtgattttgg 

aaggggtgga 
ccgcatgaac 
attctaggaa 
acaattgagg 
gaagagtaat 
ttttttctaa 



60 
120 

180 
240 

300 
360 
420 
480 
540 
600 
616 



<210> SEQ ID NO: V13;r^ >n 
<211> LENGTH: 623 V\ 5^ ^ WQS dlrfrlW ! 

<212> TYPE: PRT 

<213> ORGANISM: Soliizoyaccharomyces pombe 
<4 00> SEQUENCE :(T?^ 
Met Ala Ser Ser Lys Lys Ser Lys Thr His Lys Lys Lys Lys Glu Val 

15 10 15 

Lys Ser Pro lie Asp Leu Pro Asn Ser Lys Lys Pro Thr Arg Ala Leu 

20 25 30 

Ser Glu Gin Pro Ser Ala Ser Glu Thr Gin Ser Val Ser Asn Lys Ser 
35 40 45 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/537 f 710 



DATE: 04/08/2002 
TIME: 10:57:03 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\04082002\l537710.raw 



881 


Arg 


Lys 


Ser 


Lys 


Phe 


Gly 


Lys 


Arg 


Leu 


Asn 


Phe 


He 


Leu 


Gly 


Ala 


lie 


882 




50 










55 










60 










884 


Leu 


Gly 


He 


Cys 


Gly 


Ala 


Phe 


Phe 


Phe 


Ala 


Val 


Gly 


Asp 


Asp 


Asn 


Ala 


885 


65 










70 










75 










80 


887 


Val 


Phe 


Asp 


Pro 


Ala 


Thr 


Leu 


Asp 


Lys 


Phe 


Gly 


Asn 


Met 


Leu 


Gly 


Ser 


888 










85 










90 










95 




890 


Ser 


Asp 


Leu 


Phe 


Asp 


Asp 


He 


Lys 


Gly 


Tyr 


Leu 


Ser 


Tyr 


Asn 


Val 


Phe 


891 








100 










105 










110 






893 


Lys 


Asp 


Ala 


Pro 


Phe 


Thr 


Thr 


Asp 


Lys 


Pro 


Ser 


Gin 


Ser 


Pro 


Ser 


Gly 


894 






115 










120 










125 








896 


Asn 


Glu 


Val 


Gin 


Val 


Gly 


Leu 


Asp 


Met 


Tyr 


Asn 


Glu 


Gly 


Tyr 


Arg 


Ser 


897 




130 










135 










140 










899 


Asp 


His 


Pro 


Val 


He 


Met 


Val 


Pro 


Gly 


Val 


He 


Ser 


Ser 


Gly 


Leu 


Glu 


900 


145 










150 










155 










160 


902 


Ser 


Trp 


Ser 


Phe 


Asn 


Asn 


Cys 


Ser 


He 


Pro 


Tyr 


Phe 


Arg 


Lys 


Arg 


Leu 


903 










165 










170 










175 




905 


Trp 


Gly 


Ser 


Trp 


Ser 


Met 


Leu 


Lys 


Ala 


Met 


Phe 


Leu 


Asp 


Lys 


Gin 


Cys 


906 








180 










185 










190 






908 


Trp 


Leu 


Glu 


His 


Leu 


Met 


Leu 


Asp 


Lys 


Lys 


Thr 


Gly 


Leu 


Asp 


Pro 


Lys 


909 






195 










200 










205 








911 


Gly 


He 


Lys 


Leu 


Arg 


Ala 


Ala 


Gin 


Gly 


Phe 


Glu 


Ala 


Ala 


Asp 


Phe 


Phe 


912 




210 










215 










220 










914 


He 


Thr 


Gly 


Tyr 


Trp 


He 


Trp 


Ser 


Lys 


Val 


He 


Glu 


Asn 


Leu 


Ala 


Ala 


915 


225 










230 










235 










240 


917 


He 


Gly 


Tyr 


Glu 


Pro 


Asn 


Asn 


Met 


Leu 


Ser 


Ala 


Ser 


Tyr 


Asp 


Trp 


Arg 


918 










245 










250 










255 




920 


Leu 


Ser 


Tyr 


Ala 


Asn 


Leu 


Glu 


Glu 


Arg 


Asp 


Lys 


Tyr 


Phe 


Ser 


Lys 


Leu 


921 








260 










265 










270 






923 


Lys 


Met 


Phe 


He 


Glu 


Tyr 


Ser 


Asn 


He 


Val 


His 


Lys 


Lys 


L ys 


Val 


Val 


924 






275 










280 










285 








926 


Leu 


He 


Ser 


His 


Ser 


Met 


Gly 


Ser 


Gin 


Val 


Thr 


Tyr 


Tyr 


Phe 


Phe 


Lys 


927 




290 










295 










300 










929 


Trp 


Val 


Glu 


Ala 


Glu 


Gly 


Tyr 


Gly 


Asn 


Gly 


Gly 


Pro 


Thr 


Trp 


Val 


Asn 


930 


305 










310 










315 










320 


932 


Asp 


His 


He 


Glu 


Ala 


Phe 


He 


Asn 


He 


Ser 


Gly 


Ser 


Leu 


He 


Gly 


Ala 


933 










325 










330 










335 




935 


Pro 


Lys 


Thr 


Val 


Ala 


Ala 


Leu 


Leu 


Ser 


Gly 


Glu 


Met 


Lys 


Asp 


Thr 


Gly 


936 








340 










345 










350 






938 


He 


Val 


He 


Thr 


Leu 


Asn 


He 


Leu 


Glu 


Lys 


Phe 


Phe 


Ser 


Arg 


Ser 


Glu 


939 






355 










360 










365 








941 


Arg 


Ala 


Met 


Met 


Val 


Arg 


Thr 


Met 


Gly 


Gly 


Val 


Ser 


Ser 


Met 


Leu 


Pro 


942 




370 










375 










380 










944 


Lys 


Gly 


Gly 


Asp 


Val 


Ala 


Pro 


Asp 


Asp 


Leu 


Asn 


Gin 


Thr 


Asn 


Phe 


Ser 


945 


385 










390 










395 










400 


947 


Asn 


Gly 


Ala 


He 


He 


Arg 


Tyr 


Arg 


Glu 


Asp 


He 


Asp 


Lys 


Asp 


His 


Asp 


948 










405 










410 










415 




950 


Glu 


Phe 


Asp 


He 


Asp 


Asp 


Ala 


Leu 


Gin 


Phe 


Leu 


Lys 


Asn 


Val 


Thr 


Asp 


951 








420 










425 










430 






953 


Asp 


Asp 


Phe 


Lys 


Val 


Met 


Leu 


Ala 


Lys 


Asn 


Tyr 


Ser 


His 


Gly 


Leu 


Ala 
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RAW SEQUENCE LISTING DATE: 04/08/2002 

PATENT APPLICATION: US/09/53 7,710 TIME: 10:57:03 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\04082002\I537710.raw 



954 






435 










440 










445 








956 


Trp 


Thr 


Glu 


Lys 


Glu 


Val 


Leu 


Lys 


Asn 


Asn 


Glu 


Met 


Pro 


Ser 


Lys 


Trp 


957 




450 










455 










460 










959 


He 


Asn 


Pro 


Leu 


Glu 


Thr 


Ser 


Leu 


Pro 


Tyr 


Ala 


Pro 


Asp 


Met 


Lys 


He 


960 


465 










470 










475 










480 


962 


Tyr 


Cys 


Val 


His 


Gly 


Val 


Gly 


Lys 


Pro 


Thr 


Glu 


Arg 


Gly 


Tyr 


Tyr 


Tyr 


963 










485 










490 










495 




965 


Thr 


Asn 


Asn 


Pro 


Glu 


Gly 


Gin 


Pro 


Val 


lie 


Asp 


Ser 


Ser 


Val 


Asn 


Asp 


966 








500 










505 










510 






,968 


Gly 


Thr 


Lys 


Val 


Glu 


Asn 


Gly 


He 


Val 


Met 


Asp 


Asp 


Gly 


Asp 


Gly 


Thr 


969 






515 










520 










525 








971 


Leu 


Pro 


He 


Leu 


Ala 


Leu 


Gly 


Leu 


Val 


Cys 


Asn 


Lys 


Val 


Trp 


Gin 


Thr 


972 




530 










535 










540 










974 


Lys 


Arg 


Phe 


Asn 


Pro 


Ala 


Asn 


Thr 


Ser 


He 


Thr 


Asn 


Tyr 


Glu 


He 


Lys 


975 


545 










550 










555 










560 


977 


His 


Glu 


Pro 


Ala 


Ala 


Phe 


Asp 


Leu 


Arg 


Gly 


Gly 


Pro 


Arg 


Ser 


Ala 


Glu 


978 










565 










570 










575 




980 


His 


Val 


Asp 


He 


Leu 


Gly 


His 


Ser 


Glu 


Leu 


Asn 


Glu 


He 


He 


Leu 


Lys 


981 








580 










585 










590 






983 


Val 


Ser 


Ser 


Gly 


His 


Gly 


Asp 


Ser 


Val 


Pro 


Asn 


Arg 


Tyr 


He 


Ser 


Asp 


984 






595 










600 










605 








986 


He 


Gin 


Glu 


He 


He 


Asn 


Glu 


He 


Asn 


Leu 


Asp 


Lys 


Pro 


Arg 


Asn 




987 




610 










615 










620 











2353 <210> SEQ ID NO: 8b 

2354 <211> LENGTH: 516 . \ 

2355 <212> TYPE: DNA fef^ AsJ^*^ 

2356 <213> ORGANISM: Neurospora crassa JV5 
2358 <400> SEQUENCE: 8b r A 

E--> 2359 ggtggcgaag acga^gcgg aagttggagg ctaacgagaa tgac^ctcgg agatggatct 60 
E--> 2360 accctctaga gacacgacta cc(n^tgcacc cagcctcaag gl^tac^tt tQ:atgggta 120 
2361 ggaagccgac ggagcgagcc tacatctatc. tggcgcccga tcccgggacg acaacgcatc 180 
E--> 2362 tttagatgac gatcgatacg actttgactonaggggcacat tgaccacggt gtgattttgg 240 

2363 gcgaaggcga tggcacagtg aaccttatga gtttggggta cctgtgcaat aaggggtgga 300 

2364 aaatgaagag atacaatcct gcgggctcaa aaataaccgt ggtcgagatg ccgcatgaac 360 
23 65 cagaacggtt caatccgaga ggagggccga atacggcgga cttaaatatg tagaaaaggt 4 20 

2366 tgaaatttat gaagagtaat taaatacggc acataggtta ctcaatagta tgactaatta 4 80 

2367 aaaaaaaatt ttttttctaa aaaaaaaaaa aaaaaa 516 
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L 


502 


M:340 E: 


L 


679 


M:336 W: 


L 


683 


M:336 W: 


L 


689 


M:258 W: 


L 


689 


M:341 W: 


L 


691 


M:258 W: 


L 


691 


M: 341 W: 


L 


725 


M:340 E: 


M 


340 


Repeated 


L 


871 


M:212 E: 


L 


2254 


M: 258 W 


L 


2254 


M: 341 W 


L 


2255 


M: 258 W 


L 


2255 


M:341 W 


L 


2266 


M:258 W 


L 


2266 


M:341 W 


L 


2267 


M:258 W 


L 


2267 


M:341 W 


L 


2313 


M:336 W 


L . 


2317 


M:336 W 


L: 


2323 


M: 258 W 


L: 


2323 


M: 341 W 


L: 


2325 


M: 258 W 


L: 


2325 


M: 341 W 


L: 


2359 


M: 340 E 


M: 


340 


Repeated 



VERIFICATION SUMMARY DATE: 04/08/2002 

PATENT APPLICATION: US/09/537,710 TIME: 10:57:04 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\04082002\I537710.raw 

(46) "n" or "Xaa" used: Feature required, for SEQ ID# : 5 
Invalid Amino Acid Number in Coding Region, SEQ ID: 7 
Invalid Amino Acid Number in Coding Region, SEQ ID: 7 
Mandatory Feature missing, <223> not found for SEQ ID# : 7 
(46) n n" or "Xaa" used, for SEQ ID#:7 

Mandatory Feature missing, <223> not found for SEQ ID# : 7 
(46) "n" or "Xaa" used, for SEQ ID#:7 

(46) "n" or "Xaa" used: Feature required, for SEQ ID# : 9 
in SeqNo=9 

(34) Invalid or duplicate Sequence ID Number, SEQUENCE ID N0S:13 differs: 15 
Mandatory Feature missing, <*223> not found for SEQ ID# : 5 
(46) "n" or "Xaa" used, for SEQ ID# : 5 

missing, <223> not found for SEQ ID# : 5 
used, for SEQ ID# : 5 
missing, <223> not found 
used, for SEQ ID# : 5 
Mandatory Feature missing, <223> not found 
(46) "n" or "Xaa" used, for SEQ ID# : 5 
Invalid Amino Acid Number in Coding Region, SEQ ID : 6 
Invalid Amino Acid Number in Coding Region, SEQ ID: 6 
Mandatory Feature missing, <223> not found for SEQ ID# : ( 
(46) "n" or "Xaa" used, for SEQ ID# : 6 

Mandatory Feature missing, <223> not found for SEQ ID# : ( 
(46) "n" or "Xaa" used, for SEQ ID# : 6 

(46) "n" or "Xaa" used: Feature required, for SEQ ID#:8 
in SeqNo=8 



Mandatory Feature 
(46) "n" or "Xaa" 
Mandatory Feature 
(46) "n" or "Xaa" 



for SEQ ID#:5 
for SEQ ID#:5 
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